Speaker independent speech recognition method using constrained time alignment near phoneme discriminative frame

نویسندگان

Tomohiro Konuma

Tetsu Suzuki

Maki Yamada

Yoshio Ono

Masakatsu Hoshimi

Katsuyuki Niyada

چکیده

We present constrained time alignment acoustic models based on phonetic knowledge and a speaker independent speech recognition method using our proposed models. Japanese syllable and isolated word recognition experiments show that the models have robustness to intraand interspeaker varieties such as acoustic diversity. Furthermore we experiment with word recognition tests under the condition such as noise environments and endpoints free matching, it reveals the feasibility of our proposed models.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Text-Independent Speaker Verification via State Alignment

To model the speech utterance at a finer granularity, this paper presents a novel state-alignment based supervector modeling method for text-independent speaker verification, which takes advantage of state-alignment method used in hidden Markov model (HMM) based acoustic modeling in speech recognition. By this way, the proposed modeling method can convert a text-independent speaker verification...

متن کامل

Designing a Speaker-discrim Filter Bank for Speake

A new filter bank approach for speaker recognition front-end is proposed. The conventional mel-scaled filter bank is replaced with a speaker-discriminative filter bank. Filter bank is selected from a library in adaptive basis, based on the broad phoneme class of the input frame. Each phoneme class is associated with its own filter bank. Each filter bank is designed in a way that emphasizes disc...

متن کامل

SABR: sparse, anchor-based representation of the speech signal

We present SABR (Sparse, Anchor-Based Representation), an analysis technique to decompose the speech signal into speaker-dependent and speaker-independent components. Given a collection of utterances for a particular speaker, SABR uses the centroid for each phoneme as an acoustic “anchor,” then applies Lasso regularization to represent each speech frame as a sparse non-negative combination of t...

متن کامل

Discriminative training for continuous speech recognition

Discriminative training techniques for Hidden Markov Models were recently proposed and successfully applied for automatic speech recognition In this paper a discussion of the Minimum Classi cation Error and the Maximum Mu tual Information objective is presented An extended reesti mation formula is used for the HMM parameter update for both objective functions The discriminative training me thod...

متن کامل